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GOODNE-SS  OF  FIT  THSTS 


WITH  SPHCIAL  RFFl-RENCG  TO  TP.STS  FOR  EXPONENTIAf.riT 

1.  In  recent  years  there  have  been  big  advances  in  goodness-^of-f it 

2 

testing;  famous  tests  like  X and  those  based  on  the  empirical  distribution 
function  or  on  spacings  have  been  improved  or  extended,  particularly 
to  deal  with  nuisance  parameters;  new  tests  have  been  proposed  for  special 
situations  like  testing  for  normality  and  exponentiality;  modern  computer 
techniques  have  enabled  new  examinations  of  older  statistics  like  bj^f^nd 
measuring  skewness  and  kurtosis. 

We  wish  to  give  a guide  to  some  of  these  advances,  following  some 
principles  which  we  hope  are  based  on  practical  considerations. 

2.  The  problem  is  as  follows;  a random  sample  of  n values  of  x is 
given,  and  wc  wish  to  test  the  null  hypothesis,  that  the  distribution 
function  of  x , G(x)  , is  a given  distribution 

Hg:  G(x)  = F(x;e) 

whore  0 is  a vector  of  parameters  which  may  be  partly  or  wholly  unknown. 
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5.  Historically,  the  classical  test  is  Pearson's  X , usually  called 
chi-square;  although  of  long  history,  much  work  is  still  being  done  on 
this  statistic.  It  is  a natural  statistic  for  discrete  distributions, 
but  when  F(x;0)  is  continuous,  one  loses  information  by  grouping;  however, 
the  test  is  easily  adapted  for  use  when  0 is  unknown,  or  part  of  9 is 
unknown.  The  adaptation  Implies  a method  of  estimating  parameters  wliich 


1 


is  often  not  followed  in  practice,  and  also  uses  asymptotic  results  for 
finite  samples,  and  errors  are  introduced  in  this  way;  nevertheless,  the 
statistic  is  well  established  in  the  user’s  lexicon, 
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4.  There  are  two  other  assets  for  X which  can  be  turned  into  principles 
for  practical  use. 

Principle  1.  The  user  likes  to  keep  close  to  the  original  data;  he  does 
not  want,  if  it  is  avoidable,  to  make  an  elaborate  transformation  of  his 
x-values  to,  say,  new  z-values,  which  are  then  used  in  calculating  a test 
statistic,  but  which  as  a data  set  mean  little  to  him. 

Principle  2.  l^hen  a test  statistic  is  significant,  the  user  will  want  to 
interpret  this  in  terms  of  some  irregularity  in  the  original  data,  which 
is  suggested  by  the  form  of  the  test  statistic. 
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In  the  X test,  one  sees  the  original  data  on  a line,  and  only 
grouping  is  involved;  thus  Principle  1 is  observed.  With  reference  to 
Principle  2,  a high  value  of  the  statistic  can  usually  be  seen  to  come  from 
one  or  two  cells  and  the  irregularity  in  the  data  is  at  once  pinpointed, 

5.  Over  the  years,  other  methods  of  testing  have  been  introduced.  An 
important  problem  is  always  how  to  handle  nuisance  parameters.  We  shall 
discuss  only  tests  for  continuous  distributions,  for  which  chi-square  is  not 
so  naturally  suited,  and  will  begin  with  two  genera]  types  of  tost:  ODF 
statist ics  and  regression  tests . 

(■'.  r.DF  statistics  arc  the  oldest  historically;  the  fir':t  was  the 

Kolmogorov  (-Sr.iirnov)  statistic.  liPF  refers  to  the  empirical  distribution 

function  F (x)  : 
n 
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„ , , number  of  x-valucs  S j; 

, 

and  EOF  statistics  are  based  on  a measure  of  the  discrepancy  between 
F^Cx)  and  F(;x;0)  . Tiiese  are  both  graphed;  and,  loosely  speaking,  D , 
the  Kolnogorov-Smirnov  statistic,  is  the  largest  vertical  gap  between  the 
curves.  More  precisely,  let  F (x)  refer  to  FCx;6)  w’ith  0 known,  or 
with  estimates  of  0 inserted  where  necessary.  Then 


= sup  {F  (x)  - FCx)},  D"  = sup  {FCx)  - } 

and.  D = (D‘",D"). 

A related  statistic  is  C^^uiper’s)  V = D + D 

Other  statistics  measure  the  discrepancy  in  other  ways;  three  of  the 
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most  important  are  the  Cramer-von  Mises  W , Watson  U , and  Anderson- 


Darling  A*"  : 


v;^  = n 


{f^(x)  - FCx)r  dF(x)  , 


ii2 

U --  n 


{f  (x;) 


(p^Cy)  - F(y))  ciFCy)}  dFCx) 


= n I”  wCx)  {F^(x)  - FCx)}^  dFCx)  , 

iv.’ith 

W(x)  = PCX)}  • 

In  order  to  fix  our  ideas,  and  because  we  wish  to  concentrate  later 
on  this  tost,  let  us  suppose  that  the  h\-pothesis  to  test  is  that  n values 
(if  X constitute  a random  sninplc  from  the  exponential  distribution: 


FCx;0)  = 1 - cxpC-BCx-a))  , x > a ; 
a:'.;l  one  or  l)Ot!i  may  he  unknown.  When  0 is  known,  asymptotic  and 
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much  flniLo-n  null  disti-ibuti.on  theory  is  known  for  the  above  statistics,  and 

they  have  slowly  co:,ia  into  use.  Undoubtedly,  widen  use  was  l\cld  up  because 

in  the  important  situatioiis  wliere  0 was  not  known,  the  distribution  theory 

'was  lacking.  In  very  recent  years  this  has  been  somewhat  filled  in,  and  hDF 

statistics  can  now  be  used  for  tests  for  the  following  important  distributions; 
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(a)  the  normal,  with  p and/or  o unknown, 

(b)  the  exponential  distribution,  P unknown,  a known. 

It  is  of  course  easy  to  test  for  the  exponential  distribution  when  a is 
also  not  known;  the  smallest  number  of  the  sample  is  subtractf'd  in  turn  from 
all  the  others  and  the  resulting  set  is  tested  for  exponent iality  with  B 
unknown  and  a equal  to  zero.  The  above  tests  are  in  the  literature,  and  arc 
described  in  Stephens  (1974b).  Also,  significance  points  have  recently  been 
provided  for  tests  for: 

(c)  the  Gamma  distribution,  scale  parameter  unknown  (Pettit t and 
Stephens,  1976), 

(d)  the  extreme  value  distribution,  scale  and/or  location  parameters 
unknown  (Stephens,  1976b). 

In  summary  of  the  theoretical  side,  asymptotic  distributions  of  EOF 

statistics  do  not  depend  on  6 if  this  involves  location  and  scale  parameters. 

2 2 2 

Asymptotic  points  have  been  calculated  for  W , U and  A (Stephens,  1976a; 
Durbin,  Knott  and  Taylor,  1975),  and  recently  progress  has  been  made  for  finite 
n , for  tlie  test  for  cxponentiality,  by  Durbin  (1975)  and  Margolin  and  Maurer 
(1976).  But  in  general,  points  for  all  statistics,  for  finite  n,  have  been  found 
by  Monte  Carlo  methods.  These  must  be  extrapolated  to  give  asymptotic  points  for 
Ai  D and  V . For  all  the  statistics  Stephens  (1974b;  also  given  in  Table  54 
of  Pearson  and  Hartley,  1972)  has  provided  correction  factors  which  enable  the 
statist le.s  to  be  used  with  only  the  asymptotic  points.  Many  other  references 
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to  work  ill  this  area  arc  in  Stephens  (iy/4h,  iy76a) ; for  an  overall  summary 
of  the  (-.Dl’  and  its  properties  see  Durbin  (197d). 

Advantages  of  LDF  statistics  are; 

(a)  they  follow  Principles  1 and  2; 

(b)  they  are  easily  computed  (one  formula  for  all  n ); 

(c)  over  a wide  range  of  alternatives  they  are  more  powerful  than 

2 - 9 

X (probably  because  of  grouping  in  X”  ) > 

(d)  they  provide  consistent  tests;  if  x in  fact  comes  from  G(x) 

which  is  not  F(x;0]  , then  as  n •>  «>  , ^n^^^  statistics 

will  be  declared  significant. 

Points  (c)  and  (d)  can  be  included  as  two  new  principles: 

Principle  5.  A test  should  be  consistent  and  unbiased. 

Principle  4.  A test  should  be  powerful  over  a wide  range  of  alternatives. 

Clearly  we  cannot  find  a test  which  is,  in  general,  most  powerful  (or  even 
powerful)  over  all  alternatives  to  F(x;0)  ; and  since  the  alternative 
is  often  not  very  precisely  known,  we  establish  Principle  4,  to  cover  a 
wide  range  of  possibilities.  It  may  be  modified  somewhat  to  demand  good  power 
over  only  certain  classes  of  alternatives;  e.g.,  in  a test  for  exponentiality, 
one  may  want  power  only  against  heavy-tailed  distributions. 

7.  Regression  tests.  These  are  especially  suitable  for  situations  where 
unknown  parameters  are  location  and  scale  parameters,  and  have  been  developed 
for  tests  for  normality  and  exponentiality,  particularly  by  Shapiro  and 
Will;  (196S,  1972).  Die  test  is  based  on  a regression  of  the  order  statistics  of 
thi!  sample  against  tlie  expected  values  of  order  statistics  from  .some  canonical 
\ersion  of  tlie  distribution  tested.  Thus  suppose  , for  i from  1 to 

M , arc*  tlie  expected  values  of  order  statistics  from  N(0,1)  ; and  let  the 
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test  bo  that  an  x-set  comes  from  N(b,  ) , b and  o'  unknown.  Then 

supt)ose  we  order  x,  S < ■•*  < x . We  have 

(1)  E(x^)  = y + am. 

and  a regression  of  on  should  be  a straight  line. 

Tl^e  x^  are  correlated  because  they  have  been  ordered;  but  using 
generalized  least  squares,  an  estimate  of  O can  be  found  as  the  estimated 
slope  of  the  line.  This  is 

a = ~ = a'x  , 

say,  where  m is  the  vector  of  m.  , x the  vector  of  x.  , and  V the 

~ 1 ~ i "• 

covariance  matrix  of  normal  order  statistics.  The  test  statistic  W 

''2  2 

depends  essentially  on  a comparison  of  a with  s , the  usual  sample 
variance.  From  the  theoretical  point  of  view,  many  problems  are  posed  by 
W , both  with  distribution  theory,  even  asymptotic,  and  by  the  fact  that 
V is  not  known  for  high  n ; nevertheless,  in  practice,  Shapiro  and  Wilk 
gave  approximate  values  of  a above,  for  n 5 50  . For  n > 50  , 

Shapiro  and  Francia  (1972)  replace  m’V  ^ by  m'  ; for  both  statistics, 
null  distribution  points  are  given  by  Monte  Carlo  methods.  A disadvantage 
in  the  test  for  normality  is  that  it  needs  a different  a for  each  n . 

Dc  Wet  and  Venter  (1973)  have  investigated  the  statistics  arising  when  we 
regress  x on  h where  lu  , the  i-th  component  of  h , is  the  inverse  of 
the  standard  normal  distribution  evaluated  at  i/ (n+1)  ; this  is  "close  to" 
m.  but  not  equal  to  it.  I'or  theoretical  purposes  h is  easier  to  work 
wltli,  and  for  example,  dc  Wet  and  Venter  have  foutid  the  asymptotic  distribution 
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ol"  the  r.t:‘.tistic  correspondin"  to  V/  . LaBrccque  (1972)  lias  extended  the 
model  (1)  to 

ECx^)  = p + am^  + 62W2Cin^)  + ^3''^3Cni^)  ... 

where  w^Cm^)  is  a quadratic  and  a cubic  in  , and  used 

regression  estimates  of  8^  and  8^  to  gives  tests.  Power  is  increased 
again  for  most  alternatives,  but  the  price  is  extensive  computation;  also 
one  begins  to  get  further  from  straightforward  interpretation  of  the  test 
statistics.  Stephens  (1975)  also  has  investigated  this  model. 

8.  Advantages  and  disadvantages  of  V/.  The  W type  of  statistic  certainly 

satisfies  Principles  1 and  2;  the  regression  picture  is  informative.  For 

the  normal  tost,  it  has  high  power  (not  so  overwhelmingly  better  than  EOF 

statistics  as  at  first  asserted,  see  Stephens  (1974b),  but  nevertheless  on  the 
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whole  marginally  better  than  W or  A -,  and  these  in  turn  are  much  better  than 
2 

D or  X )'  0^-  other  hand,  W is  more  difficult  to  compute,  and 

another  disadvantage  is  the  lack  of  mathematical  tlieory  referred  to  above. 

Of  course,  this  may  be  rectified  in  due  course.  Connected  with  this  is  a 
greater  difficulty;  the  W technique  may  not  be  consistent.  The  corresponding 
tost  for* the  exponential  distribution  above  is  based  on  a tost  statistic 

n(x  - X )^ 

v;  ^ , 

(n  - 1){E  x7  - nx  } 

i ^ 

and  Shapiro  and  V.’ilk  have  presented  this  test  and  given  Monte  Carlo  percentage 
points.  Sarkadi  (197S)  has  recently  cast  doubt  on  the  consistency  of  the 
Siiapi  ro-V;i  11;  tests  for  noi-mality  and  exponential  ity,  though  lie  affirms  the 
con;',  i stcncy  of  the  Shapiro-Francla  (1972)  test.  Sarkadi  gives  a version  of 


iBII|||||piP|jPPPI^WI|PIW^  VI...  I ■ 111. . ..  I !,■  .1  ,1  I I II,-  

which  lui  states  is  consistent,  but  no  percentage  points.  It.  would  seem 
fluit  more  work  is  still  needed  on  the  consistency  of  these  procedures,  and 
indeed  of  other  statistics. 

9.  Before  turning  to  tests  for  exponentiality  in  more  detail,  we  point 
out  other  important  lines  of  recent  research. 

(a)  In  connection  with  eliminating  nuisance  parameters  before  a test 

can  be  applied,  O'Reilly  and  Quesenberry  (1973)  have  a method,  the  conditional 
probability  integral  transformation,  which  is  elegant  and  of  considerable 
mathematical  interest;  however,  quite  extensive  computations  arc  heeded. 

(b)  Watson,  and  later  Durbin  and  co-workers,  and  Stephens,  have 

2 

developed  ^ ~ as  a Fourier  series;  W is  a 

functional  of  and  its  properties  can  be  written  in  terms  of  the 

2 2 

Fourier  coefficients.  Similar  ivork  was  done  also  on  U and  A by  these 
authors.  (D\irbin  and  Knott,  1972;  Durbin,  Knott,  and  Tjiylor  1975; 

Stephens,  1974a). 

Asymptotic  distributions,  and  also  asymptotic  power,  can  be  found  for 
certain  alternatives  (tending,  of  course,  to  the  null  as  n ->■  " ) . Durbin 
and  his  cc-workers  also  propose  use  of  the  low-order  Fourier  coefficients 
(which  they  call  components)  as  test  statistics,  and  show  these  to  be  more 
powerful,  in  certain  circumstances,  than  the  entire  statistics  (e.g.,  in  a 
test  of  ll(0,l)  when  the  alternative  is  N(y/*^>  1)  > Y a positive  constant). 
However,  in  Stephens  (1974a)  it  is  shov;n  that  these  circumstances  arc 
restricted,  so  i’:uch  so  that  it  is  difficult  to  recommend  components.  For  example, 
in  a tost  For  N(0,1)  against  N(Yj//n^,  1 »•  Y2/'^j  . component  sensitive  to 
ihc!  value  of  Yj  might  be  totally  unaffected  by  Y^  . From  a practical  point 
of  view,  expcclally  wlion  parameters  must  be  estimated,  they  violate  Principle  1. 
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1h)t-  ox:'.:!!;-)! c' , co:iside:'  ti’c  test,  .for  exponent i. 'i  1 Ity,  a 0 , 3-  iinknoi.n;  one 
first  must  calci’.late,  from  x.  , tlic  value 

= 1 - exp(-x./x)  ; 

then,  for  j - 1,2,. ..,10  , the  values 


y.  = /27n  Z cosfTrj  x.)  ; 
J i=l  ^ 


finally,  the  k-th  component  is  given  by 


"kJ  "j  • 

where  bj,^  are  found  by  metliods  of  Durbin,  Knott  and  Taylor  (1975) . (It 
may  be  that  the  b.  ■ given  in  the  Durbin,  Knott  and  Taylor  paper  are  in  error 
(Stephens,  1976c)). 

Not  only  are  these  techniques  (also  that  of  de  Wet  and  Venter  (1973)) 
mathemvatically  interesting;  they  also  give  impetus  to  questions  concerning 
the  purpose,  efficiency,  and  utility  of  goodness-of-fit  tests,  and  liow  to 
start  measuring  these  concepts. 

(c)  On  very  practical  lines,  recent  work  by  D'Agostino  and  Pearson 
(1973)  and  by  Bowman  and  Shenton  (1975)  has  ercouraged  the  use,  in  tests 
for  normality,  of  the  ske^vness  statistics  b^^  and  the  kurtosis  statistics 
These  are  well  established  statistics;  the  first  authors  gave  new  tables  of 
significance  points,  and  suggested  how  they  might  be  combined  in  an  omnibus 
test  statistic;  however,  they  assumed  independence  of  b^  and  , and 

this  is  not  justlficx’.  Another  approach  to  using  both  /b^  and  b^  is  to 
mark  them  as  a point  in  a 2-dimensional  graph;  if  the  point  falls  outside 
c<'n tours  on  the  gra|)h,  normality  will  be  rejected.  Tliis  is  the 
approacli  of  Bowr-an  and  Shenton  (1975);  it  provides  a tcchniciue  for  using 
two  statistics  simultaneously  to  assess  1!^^  , although  one  can  regard  the 
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contour  as  a function  of  and  which  is  in  effect  a tlurd  statistic 

i)y  which  is  judyed.  The  above  methods  appeal  to  Principle  1;  the 

practical  man  has  a "feel"  for  b^  and  and,  possibly,  for  the  implications 

if  tlie  values  lead  to  rejecting  . 

10.  Tests  for  the  exponential  distribution.  We  novv  return  to  the  exponential 
tost,  to  the  special  case  where  a is  knoivn.  This  may.  be  taken  as  zero;  it 
is  convenient  to  replace  S by  0 so  that  11^  is  that  the  sample  is  from 

(2)  F(x;0)  = 1 - expC-0x)  , x > 0 , 

where  the  scale  parameter  0 is  unknown. 

A multitude  of  test  statistics  have  been  posed  for  this  important  problem; 
not  only  does  the  distribution  arise  in  many  statistical  problems,  but  it 
also  appears  prominently  in  the  theory  of  renewal  processes,  in  particular, 
the  Poisson  process.  If  events  are  random  in  time,  we  expect  the  intervals 
between  them  to  be  exponential.  Important  questions  arc  then  whether  this 
is  so,  against  a more  j'eneral  Gamma  alternative,  say,  or  whether  they  are 
exponential,  but  0 is  changing.  Some  test  statistics  have  been  devised 

p 

with  these  applications  in  mind. 

We  have  already  mentioned  (a)  EOF  statistics  and  (b)  Shapiro-Wilk  W^; 
as  available  tests.  Most  other  tests  exploit  some  interesting  connections 
between  the  exponential  distribution  and  the  uniform.  These  are  briefly 
described  as  follows. 

(a)  Suppose  [not  in  order]  is  an  exponential  sample 

from  F(x;0)  ; let  ordered  sample. 

(!•)}  Su;ipo3C  U j ,U.^,  . . . ,1)^^  is  a random  (i.c.,  unordcred]  sample  from 
IKO.II  , aiul  let  be  the  ordered  sample;  let  be  the  spacings 
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cHo  * 1*0,,  Dj  — 1^2  *"  *■*> 

. Note  that  tlie  spacings  will  not  be  ordered; 
denote  the  oi'dercd  spacings. 

The  G transformation:  uniforms  to  uniforms  . 


= ^(n)"^Cn-l)’ 
lot  D-.-. 


Then  D!  is  another  set  of  unordored  xiniform  spacings  (Sukhatine,  1937) . 
One  can  clearly  build  up  another  ordered  uniform  sample, 


3 

T. 

i=l 


D. 


We  shall  write  U’  = GU  for  this  transformation.  Durbin  (1961)  showed 
that,  loosely  speaking,  G makes  large  spacings  larger  and  small  spacings 
smaller.  It  makes  a "nearly  uniform"  sample  appear  further  removed  from  uniform, 
though  it  cannot  be  repeated  indefinitely;  after  a certain  point,  an  extremely 
non-uniform  sample,  i.e.  very  uneven  spacings,  is  transformed  to  appear  more 

uniform  than  before. 

(d)  The  J transformation:  exponentials  to  uniforms.  Tut  the 
successive  (unordered,  exponential)  along  a line  in  sequence.  Let 

the  total  X = Z . Divide  by  Z to  get 

3 

X./Z  , j = l,...,n-l  . 

'llion  (well  knov.n) ; are  ordered  uniforms;  note  there  are  only  n - 1 

of  them.  V.'e  write  U JX  . 
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■X- 


X,  X. 


After 

J iv  1 sk-n  0 
by  Z 


U u ^ u ^ 
CD  (2)^(3) 


“to 


u,  T 

(n-l) 


']  1 


In  the  case  of  renewal  processes,  the  quantities  shown  with  a 

cross,  will  be  times  of  events  (j-th  blip  on  a screen,  j-th  battery  failure 
(batteries  being  inserted  as  soon  as  one  fails)).  One  can  adapt  the  above 
to  the  situation  where  one  observes  for  fixed  time  , and  obtains  n 
observations  (used  batteries)  in  , and  time  left  over.  Divide  by 
and  the  are  n uniforms. 

(c)  The  K transformation:  exponentials  to  uniforms. 

(1)  Start  with  X (exponentials);  find  ” = JX  . Note: 

U are  ordered  uniforms;  but  their  sizes  depend  on  the 
original  random  ordering  of  the  X^^  . 

(2)  Apply  G to  get  U'  = GU  . U’  are  ordered  uniforms;  the 
combined  U = GJX  gives  the  same  set  of  U’  no  matter 
what  the  original  X-ordcring.  A formula  can  be  found  to 
give  Uj,  directly  from  the’  ; we  write  U’  = KX  (=  GJX) 
(Seshadri , Csorgo  and  Stephens,  1969). 

C^)  The  N transformation:  exponentials  to  exponentials.  We  can 

I I 

complete  the  circle  by  returning  the,  to  exponentials  X^  , and  these 
can  be  found  directly  from  the  X^  - Let  X^^^  = 1 ; 


X,  = (n  + 1 - i)(X 


(i) 


hi-n’ 


i = 1 , . . . ,n 


V.'c  write  X'  - NX  ; the  X'  are  exponential,  parameter  0 , i.c.  from 
distribution  (2). 


1? 


N’otc  that  tho  transCormations  J and  1C  reduce 


^ ^ tests  ■ 

an  exponetttial  sample  to  ordered  uniforms;  in  so  doing  they  eliminate  the 
unknown  parameter  0 . Conversely,  properties  of  the  uniform  distribution 
can  thus  be  used  to  give  estimates  of  0 , including  estimates  from  only 
the  first  r values  of  X , i.e.  a censored  sample.  The  maximum  likelihood 
estimate  from  a complete  sample,  used  in  EOF  tests,  is  1/X  . 

Once  the  exponentials  have  been  changed  to  uniforms,  tests  for 
uniformity  can  be  directly  applied.  Sometimes  this  is  the  way  the  tests 
are  proposed;  at  other  times  the  tests  are  given  in  terms  of  the  original 
exponentials.  Tliese  connections  are  pointed  out  in  order  to  try  to  see 
the  relations  between  many  of  the  proposed  statistics. 

1 2 . Proposed  tests  for  exponentiality. 

The  following  are  some  proposed  techniques  for  testing  for  exponentiality, 
roughly  graded  into  groups. 

Group  A - Direct  use  of  the  EPF, 

2 

(1)  EDF  statistics:  direct  estimation  of  0 and  use  of  D,  V,  IV  , 

2 2 

U or  A , say.  1-tail  test 

(2)  Srinivasan  I)  : closely  related  to  D above  and  asymptotically 

equivalent  to  it  (Moore,  1973);  uses  Rao-Blackwell  theorem  for  bettor 
estimate  of  F(x;0)  . 1-tail  test 

2 

(5)  Finkclstcin  and  Schafer,  S*  : an  EDF  test,  similar  to  W in  (1); 
s*  - r.  6.  where  6.  = max[  If(X^.  ^ ;0)  - (i-l)/n  [ . jp  (X^..  ^ ;0)  - i/n(]  . 

1-tail  test 


Monte  Carlo  points  arc  given  for  S*  . 


Group  B - i>.c\groi>sion. 

(4)  Modified  Shapiro-Wilk  (Stephens,  1977) 


v;*  = 


(z  x.y 


n{nH-l)  Z Xj  - (Z  x^)^} 


distributed  like  W..  for  n + 1 observations. 

c 


2-tail  test 


(The  test  for  origin  "known”,  as  given  by  Shapiro  and  VJilk  (1972)  is 

strictly  a composite  test,  for  both  exponential  shape  and  a hypothesized 
* 

origin;  W is  the  version  comparable  to  other  tests  being  considered 
here.  For  the  alternatives  below,  it  is  generally  better  than  the  Wp 
test  given  by  Shapiro  and  Wilk  (Stephens,  1977)). 

Group  C - Tests  based  on  transformations  to  uniformity. 


Transformations  J and  K below  refer  to  those  described  in  Section  11. 

If  we  decide  to  produce  uniforms  by  J or  K , we  can  then  follow 
up  with  appropriate  tests  for  unifoi'mity,  e.g. 

(i)  EPF  tests,  or 

(ii)  tests  based  on  spacings ; much  impetus  to  this  established 

line  of  work  was  given  by  Pyke  (1965); 

(iii)  tests  based  on  the  position  of  the  mean  of  the  U or  U' 

(produced  by  J or  K respectively)  or  of  one  order  statistic 

U,  > or  . for  a chosen  r ; 

(r)  (r) 

(iv)  the  Shapiro-Wilk  regression  test.  This  is  now  based  on  the  range 
over  the  standard  deviation  of  the  IJ  (or  II')  sample. 


Some  tests  using  these  techniques  are: 

(.5)  J : followed  by  E'JP  statistics  in  (1)  above  for  testing  for  uniformity. 
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Lewis  (196S)  shows  the  procedure  to  lend  to  inconsistent  tests;  tlri  s 
- was  verified  also  in  Seshadri,  Csor^o  and  Stephens 

(6)  K ; followed  by  l-DF  statistics  for  testing  uniformity,  i,c. 

K , then  D , V , or  1 tail  test 

These  were  considered  by  Seshadri,  Csorgo  and  Stephens  (1969),  and 
are  generally  better  than  (6) . 

(7)  K : followed  by  U'  , i.e.  the  mean  of  U'  . Suggested  by  Lewis 
(1965):  Lewis'  S'  is 

S'  = U’  = 2n  - 2 E i 

where  Z = F,X^  . This  can  be  regarded  as  a test  derived  from  regression  of 
X,.^  on  i..  2 tail  test 

(t-)  

(S)  K : followed  by  fot  suitable  r . First  suggested  by  Lewis 

(1965),  and  later  by  Tiku,  Rai  and  Mead  (1974);  the  latter 
rcconmended  r cy  n/2  . 2 tail  lest 


Gi^oup  D 


n 


Other  tests.  Let  Z = Z X.  , X = Z/n  (the  X.  are  the  original  exponential 

i=l  ^ ^ 


from  (2)).  Let  m.  be  E(X-..)  when  0 = 1;  m.  is  then  Z (n  - j + 1) 

^ ^ j-1 


-1 


(9)  Jackson  (1967): 


T = 


n 

Z m.  X... 

i=i  ' w 


2-tail  test 


This  cm  be  regarded  as  derived  from  a regression  of  on  m^ 


(lOj  Mor.!-  (1947,  1951): 


M = -2  Z In 
i-0 


IK  1 
.X 


.15 


equivalont  to 


n _ 

M*  = -2  X (In  X.  - In  X) 
i=l  ^ 


equivalent  to 


n 

M**  = -2  Z In  nD. 

1 


2 

On  Hq  , M is  distributed  X2^  ♦ 2-tail  test 

M is  related  to  Bartlett’s  test  for  equal  variances.  Moran’s  test 
is  as)'mptotically  nost  poiverful  against  the  general  F-distribution 
alternative  (Shorack,  1972) : 

f(x)  = 9^  exp(-0x)^r(a)  , x > 0 , a > 0 . 


It  is  a strong  test  against  the  Weibull  alternative 

f(x)  = 0(Ox)^  ^ exp(- (0x)^/a)  , x > 0 , a > 0 


(Bartholomew,  1957) . 

^ 2 ^2 

(11)  Bartholomew  (1957):  S = Z (X,/Z)  equivalent  to  S = Z D.  . 

i=l  ^ i=l  ^ 

1-tail  test 

The  second  form  of  S was  discussed  by  Moran  (1947). 

(12)  Gail  and  Gastwirth  (1976):  this  is  a recently  proposed  tost  based  on 

’■n'Pl  = Jj  ’'(dA 

where  r - fnp]  . They  suggest  p = 0.5  . 

(i;<)  Tr;tn;;i’n!(nntion  N (exponentials  to  exponentials),  followed  by  Moran's  M 
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Cousu’erod  by  Upstcln  (1960);  Pcrcho  iukI  R.rii{;cr  (1972)  gave  pov/er 
agaln&t  Weibull  alternatives,  in  the  context  of  arrival  times,  and 
failure  rates. 

15.  Power  studies.  We  have  examined  the  power  of  most  of  these  statistics 
by  Monte  Carlo  methods.  Some  results,  for  n = 10  and  20.,  and  for  lO-s 
tests,  are  given  in  Tables  1 and  2. 

Notes: 

(a)  Powers  are  given  in  percent;  numbers  in  parentheses  indicate  the 
number  (in  lOOO's)  of  M.C.  samples  used  for  the  experiment. 

Cb)  Authors  usually  recommend  whether  the  test  is  one-tail  or  two- 
tail;  this  advice  was  followed,  and  no  two-tail  test  was  adapted  to  one-tail 
in  order  to  improve  the  power  for  a specific  alternative.  Thus  the  test 
is  performed  as  though  the  alternative  \vere  unlcno^^rn;  naturally  one  might 
adapt  the  test  to  one- tail  against  known  alternatives.  This  might  be 
particularly  important  where  the  test  is  for  inter-arrival  times  or  failure 
rates,  and  the  alternative  is  a speeding-up  of  observations,  or  a lower 
(higher)  failure  rate. 

(c)  Mostly  our  results  match  those  given  by  other  authors,  the  exception 
being  ivhere  those  authors  did  adapt  the  tests  as  in  (b)  above. 

14,  Conclusions. 

(a)  The  following  all  give  good  results: 

2 2 

(1)  F.DF  using  estimation  of  0 , and  the  A , K or 
Fi nhelstcin-Schafcr  statistic, 

(2)  Lewis,  Jackson,  Shaplro-Wilk, 
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(3)  Moran, 

2 2 

(-0  K- transform,  using  A , W . 

(b)  Groups  (1)  and  (2)  in  (a)  follow  Principle  1 quite  closely:  the 

F.Dh  picture  is  informative,  the  Lewis  and  Jackson  statistics  are  dependent 
on  the  slopes  of  a plot  of  against  i and  respectively 

(ignoring  covariances) ; the  Shapiro-Wilk  test  uses  a regression  on  m.  , 

including  covariance.  Moran's  statistic  is  perhaps  less  pictorial,  but  if  the 

are  put  sequentially  along  a line  (not  ordered),  the  spacings  should  be 
like  uniform  spacings  and  Moran  tests  this.  It  has  the  merit  of  known  best 
power  against  some  important  alternatives.  The  K transform  is  less  direct: 
though  if  the  mean  of  the  ordered  uniforms  so  generated  is  then  used  as  test 
statistic,  it  leads  directly  to  Lewis’  statistic. 

(c)  If  our  results  for  Moran  are  compared  with  those  of  Fcrcho  and 

Ringer  C1972)  for  the  N transformation  followed  by  Moran^s  statistic  (there 
called  Epstein),  it  seems  that  the  direct  use  of  Moran’s  statistic  is  preferable 

for  the  ’v'eibull  alternative  considered  (i.e.,  it  is  not  helpful  to  use  the  N 

transformation) . 

(d)  The  Kolmogorov-Smirnov  statistic  is  not  good  either  I'fith  EOF  directly 
or  after  the  transformation  K ; similarly  for  the  closely  connected  Srini van’s  D 

(e)  It  is  still  necessary  to  pursue  research  on  consistency.  Two  main 
problems  arc  involved:  (1)  are  given  tests,  say  for  uniformity  when  a transfor- 
mation like  K has  been  made,  themselves  consistent  against  departures  (or  at 
least  a wide  class)  from  uniformity?  (2)  do  several  parent  populations  all  give 
uniformity  under  one  of  these  transformations? 

(f)  The  test  considered  here  has  been  the  test  that  a sample  is  from 
an  exponential  parent  population.  V.’hcn  the  problem  concerns  arrival  times 
or  otlicr  problei;;s  wT.ere  the  exponentials  arrive  sequentially,  there  will 
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hi.!  special  alternatives  which  will  promote  one  tost  over  anotlicr.  It  would 
appear  that  further  work  is  needed  on  the  efficiency  of  tests  for  this 
particular  problem. 
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The  distributions  referred  to  in  the  table  include; 


Beta  (a,  b)  : f(x)  = const,  ^ ^ ^ 

Uniform  : f(x)  = 1;  ^ S.  ^ S.  ^ 

Lognormal  0 : f(x)  = const,  exp  {-(In  x)  /20  } • ^ 0 

Weibull  0 : f (x)  = const.  X®  ^ expC-x®)  ; x >0 

Half-normal  : y is  N(0,1)  , x is  |y|  ; ^ > 0 

Half-Cauchy  : y is  Cauchy,  median  zero; x is  |yl;  X > 0 


Test  statistics  are  abbreviated  as  follows: 

S - Srinivasan 

F-S  - Finkelstein-Schafor 

S-W-S  7.  Shapiro-Wilk,  modified  by  Stephens 

S-C-S  - Seshadri -Csorgo-Stephens 

L - Lewis 

T-R-M  - Tiku-Rai-Mead 

J - Jackson 

M - Moran 
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A general  review  is  given  of  recent  advajnces  in  goodness -of -fit, 
and  some  principles  axe  stated  vdiich  it  is  desirable  that  a goodness - 
of -fit  test  shoul.d  follow.  The  second  part  of  the  paper  deals  with  a 
list  of  proposed  tests  for  the  exponential  distribution,  and  some  of 
the  ways  in  which  they  are  interconnected  are  demonstrated.  The  power 
of  the  tests  is  examined  by  Monte  Carlo  studies  and  two  tables  are 
given. 
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